Thematic Analysis and Visualization of Textual Corpus

نویسندگان

  • Anja Habacha Chaïbi
  • Ferihane Kboubi
  • Mohamed Ben Ahmed
چکیده

The semantic analysis of documents is a domain of intense research at present. The works in this domain can take several directions and touch several levels of granularity. In the present work we are exactly interested in the thematic analysis of the textual documents. In our approach, we suggest studying the variation of the theme relevance within a text to identify the major theme and all the minor themes evoked in the text. This allows us at the second level of analysis to identify the relations of thematic associations in a textual corpus. Through the identification and the analysis of these association relations we suggest generating thematic paths allowing users, within the frame work of information search system, to explore the corpus according to their themes of interest and to discover new knowledge by navigating in the thematic association relations.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Thematic Organization in MA TEFL Students' Argumentative, Cause and Effect, and Process Types of Writing

It is generally recognized that many second language learners have difficulties with cohesion in academic texts.  Writing seems to be the most difficult subject for many students. To produce good writing, it is necessary to know how to organize Theme and Rheme in a text. Thematic structure as an important feature in textual metafunction plays a significant role in promoting the textual coherenc...

متن کامل

Cultural Elements in the English Translations of the Iranian ‘Resistance’ Literature: A Textual, Paratextual, and Semiotic Analysis

The present corpus-based study addressed the strategies applied in translating the cultural elements (CEs) of the Iranian ‘resistance’ literature into English. The corpus comprised Chess with the Doomsday Machine, Eternal Fragrance, and Fortune Told in Blood translated by Sprachman, Omidvar, and Ghanoonparvar, re- spectively. The Persian books and their English translations were analyzed on thr...

متن کامل

Semantic Visualization and Navigation in Textual Corpus

This paper gives a survey of related work on the information visualization domain and study the real integration of the cartography paradigms in actual information search systems. Based on this study, we propose a semantic visualization and navigation approach which offer to users three search modes: precise search, connotative search and thematic search. The objective is to propose to the user...

متن کامل

The Use of Lexical Bundles in Native and Non-native Post-graduate Writing: The Case of Applied Linguistics MA Theses

Connor et al. (2008) mention “specifying textual requirements of genres” (p.12) as one of the reasons which have motivated researchers in the analysis of writing. Members of each genre should be able to produce and retrieve these textual requirements appropriately to be considered communicatively proficient. One of the textual requirements of genres is regularities of specific forms and content...

متن کامل

Modelling the flow of discourse in a corpus of written academic English

Discourse studies attempt to describe how context affects text, and how text progresses from one sentence to the next. Systemic Functional Linguistics (SFL) offers a model of language to describe how information flow varies according to context and co-text through the Textual metafunction, especially using the functions of Participant Identification and Tracking, Theme and Information Structure...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1112.2071  شماره 

صفحات  -

تاریخ انتشار 2011